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HIV-1  and  HIV-2  represent  two  distinct  groups  of  human  immunodeficiency  viruses 
known  to  cause  AIDS  in  infected  individuals  (3,11,12,21,43).  While  HIV-1  is  the  causative 
agent  of  epidemic  AIDS  worldwide,  HIV-2  has  generally  been  restricted  to  West  Africa 
(4,7,13,28,31).  Numerous  isolates  of  HIV-1  and  HIV-2  have  been  obtained  and  their  biological 
and  molecular  properties  characterized  (1 ,3,12,23,43,54).  Nucleotide  sequence  analysis  shows 
that  HIV-2  is  only  distantly  related  to  HIV-1  (23),  while  it  is  more  closely  related  to  two  primate 
retroviruses,  SIVMAC  and  SIVSM,  which  cause  an  AIDS-like  disease  in  captive  macaques 
(8.14,27).  Although  genetically  divergent,  prototype  HIV-1  and  HIV-2  viruses  share  similar 
overall  genomic  organization  and  have  similar  biological  properties,  which  include  their 
propensity  for  rapid  genetic  change  (18,45,54),  their  host  cell  tropism,  their  cytopathic  effect 
on  T-cell  cultures  and  peripheral  blood  mononuclear  cells  in  vitro,  and  their  ability  to  form 
syncytia  with  CD4-bearing  target  cells  (12,43).  In  fact,  the  majority  of  HIV-1  and  HIV-2  strains 
isolated  from  patients  with  immunodeficiency  disease  have  been  shown  to  cause  cell  fusion 
and  the  formation  of  multinucleated  giant  cells  in  culture.  This  represents  a  hallmark  of 
productive  viral  infection  in  tissue  culture  and  accounts  for  the  profound  in  vitro  cytopathic 
effects  of  HIV  (25,39,52).  In  contrast  to  HIV-1  and  HIV-2  isolated  from  clinically  ill  individuals, 
natural  infection  of  African  green  monkeys  by  SIV  occurs  with  high  prevalence  in  East  and 
West  Africa  but  does  not  lead  to  immunodeficiency.  In  addition,  there  is  growing  evidence  that 
the  biologic  properties  of  HIV-1  in  vivo  may  differ  significantly  from  tissue  culture  adapted 
strains.  Thus,  in  order  to  characterize  HIVs  and  SiVs  as  they  exist  in  vivo,  the  goals  for  year 
03  of  this  contract  were:  (i)  to  characterize  by  genetic  and  biologic  analysis  an  HIV-2  strain 
obtained  during  year  02  of  this  contract  which  showed  attenuated  virulence  in  vitro  and  in  vivo; 
(ii)  to  develop  PCR  approaches  that  would  allow,  for  the  first  time,  the  direct  identification, 
cloning,  and  sequence  analysis  of  SIV  and  HIV  from  uncultured  blood  of  West  African  green 
monkeys  and  humans  West  African  green  monkeys  and  humans  (all  previous  clones  of  SIV^ 
have  been  derived  from  East  African  green  monkeys);  (iii)  to  obtain  by  direct  lambda  phage 
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cloning  full-length  transfection  competent  clones  of  HIV-1  from  uncultured  human  brain  tissue. 
Molecular  Characterization  of  HIV-2/ST 

In  contrast  to  the  prototype  pathogenic  HIV-1  and  HIV-2  viruses,  we  and  others  have 
recently  isolated  less  cytopathic  strains  of  HIV-1  and  HIV-2  which  exhibit  markedly  different 
biological  properties  (2,9,16,34,36,50).  These  particular  isolates  cause  little  or  no  cell  death 
in  susceptible  target  cells,  fail  to  induce  cell  fusion  with  CD4-bearing  immortalized  T-cell  lines, 
exhibit  a  restricted  host  cell  tropism  with  a  preference  for  PBMC  and/or  macrophages,  and  are 
often  derived  from  asymptomatic  individuals.  While  their  in  vitro  biological  differences  are  well- 
documented,  the  genetic  changes  responsible  for  their  attenuated  phenotype  are  not 
understood.  In  order  to  elucidate  determinants  of  HIV  pathogenicity,  we  have  thus  begun  to 
molecularly  dissect  a  non-fusogenic  and  non-cytopathic  HIV-2  isolate,  termed  HIV-2/ST,  which 
was  obtained  from  a  healthy  Senegalese  prostitute  as  part  of  year  02  work  for  this  contract 
(34).  Although  this  virus  replicated  to  high  titers  in  tissue  culture,  it  infected  cells  at  a  slower 
rate  compared  to  cytopathic  strains  of  HIV-1  and  HIV-2,  and  caused  little  or  no  cell  kilting  and 
fusion.  This  was  the  case  despite  the  fact  that  its  external  envelope  glycoprotein  was  cleaved 
correctly,  transported  to  the  cell  surface,  and  shown  to  bind  to  a  specific  epitope  on  CD4, 
which  was  recognized  by  OKT4a  but  not  OKT4  antibodies  (34).  HIV-2/ST  therefore  appeared 
to  bind  to  the  CD4  molecule  analogous  to  other  HIVs,  but  it  failed  to  fuse  with  CD4-bearing 
target  cells  suggesting  that  its  infectivity  was  greatly  retarded  at  the  level  of  cell  entry  (34). 

Since  HIV  "isolates"  generally  represent  complex  mixtures  of  genotypically-distinct 
viruses,  and  since  the  biological  phenotype  of  any  HIV  culture  depends  on  the  sum  of  the 
properties  of  each  genotypic  variant  (22,45),  we  first  attempted  to  isolate  a  molecular  clone, 
which  was  both  transfection-competent  and  representative  of  the  in  vitro  properties  of  its 
parental  virus.  We  therefore  obtained  three  full-length  proviral  clones  (XJSP4-27,  AJSP4-32, 
and  AJSP4-34)  from  a  genomic  library  of  a  biologically-cloned  high  producer  cell  line,  termed 
ST/B12,  and  subsequently  transfected  them  into  the  neoplastic  T-cell  lines  SupTI  (49)  and 
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CEMx174  (29,47).  Reverse  transcriptase  activity  was  detected  in  supernatants  of  cultures 
transfected  with  XJSP4-27  as  early  as  five  days  post-transfection,  while  XJSP4-32  and 
XJSP4-34  transfected  cultures  revealed  no  signs  of  viral  replication,  indicating  that  these 
proviruses  were  replication-defective.  Immunofluorescence  analysis  further  confirmed  the 
presence  of  virus  expressing  cells  in  XJSP4-27  transfected  cultures,  but  failed  to  identify  virus 
mediated  cell  fusion.  Western  blot  analysis  of  purified  JSP4-27  virions  demonstrated  a  protein 
profile  similar  to  that  of  the  parental  HIV-2/ST  virus.  To  facilitate  subsequent  transfection 
experiments  and  to  allow  the  direct  comparison  of  JSP4-27  to  other  transfection-competent 
HIV-2  plasmid  constructs,  we  sub-cloned  the  proviral  insert  of  AJSP4-27  into  the  plasmid  vector 
pSP65  (Figure  1A). 

To  test  whether  the  transfection-derived  JSP4-27  virions  were  infectious,  filtered 
supernatants  of  plasmid-transfected  cultures  were  transmitted  to  uninfected  SupTI  and 
CEMx174  cells.  The  results  showed  that  cell-free  transmission  of  JSP4-27  virions  was  readily 
and  reproducibly  demonstrable.  However,  infection  and  spread  in  culture,  particularly  in  SupTI 
cells,  occurred  slowly  and  with  considerable  delay.  These  results  were  confirmed  and 
extended  in  comparative  studies  with  a  transfection-derived,  cytopathic  HIV-2/ROD  strain, 
termed  SL1  (38).  While  transfection  of  the  SL1  provirus  resulted  in  >90%  infected  SupTI  or 
CEMx174  cultures  within  three  to  four  days  post-transfection,  JSP4-27  transfected  cultures 
reached  only  10%  infectivity  in  the  same  time  period,  which  indicated  the  same  reduced  ability 
to  spread  in  culture  that  had  been  observed  for  the  parental  HIV-2/ST  virus  (34).  Similarly, 
the  transfection-derived  JSP4-27  cultures  did  not  form  syncytia  upon  cocultivation  with  several 
CD4-bearing  T-cell  lines,  including  SupTI,  CEM,  H9  and  CEMx174  cells,  whereas  SL1  infected 
cultures  produced  numerous  and  large  syncytia,  as  well  as  a  profound  cytopathic  effect,  with 
these  same  target  cells  (Figure  1 B).  These  data  thus  showed  that  the  JSP4-27  provirus  was 
replication-competent  and  infectious,  and  exhibited  the  same  non-fusogenic  and  non-cytopathic 
properties  as  previously  described  for  the  parental  virus  (34). 
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Having  identified  and  characterized  the  biological  features  of  a  molecular  clone  of 
HIV-2/ST,  we  next  sequenced  its  entire  genome.  The  complete  nucleotide  sequence  of  the 
JSP4-27  provirus  is  depicted  in  Figure  2.  The  viral  genome  is  9,672  bp  in  length  and  exhibits 
an  overall  genomic  organization  of  5’LTR-gag-pol-central  region-env-nef-3’LTR,  which  is 
identical  to  that  of  other  cytopathic  HIV-2  and  SIVMAC  proviruses.  It  contains  all  major  open 
reading  frames  characteristic  for  HIV-2,  including  vpx  which  is  present  in  HIV-2  and  SIVUAC 
viruses  but  not  in  HIV-1  (17,26,32,53),  and  vpr  which  is  present  in  HIV-1,  HIV-2  and  SIVMAC 
but  not  in  SIVAGM  viruses  (19).  Like  other  HIV/SIV  proviruses,  HIV-2/ST  is  flanked  by  LTR 
sequences  which  are  known  to  regulate  viral  gene  expression.  Sequence  comparison  with 
other  HIV-2  LTRs  showed  that  regulatory  elements,  like  the  tata  box,  the  polyadenylation  site, 
core  enhancer  sequences,  Spl  binding  sites,  and  the  tat  responsive  region  are  all  present  in 
HIV-2/ST  and  that  their  sequences  are  highly  conserved.  The  HIV-2/ST  LTR  is  of  similar 
length,  and  there  are  no  major  deletions  or  insertions  which  would  distinguish  it  from  the  LTRs 
of  other  cytopathic  HIV-2  viruses  (data  not  shown). 

Comparison  of  the  deduced  amino  acid  sequence  of  the  HIV-2/ST  reading  frames 
suggested  that  they  all  encoded  full-length  and  functional  proteins,  with  the  exception  of  the 
vpr  gene.  This  open  reading  frame  was  found  to  contain  an  in-frame  TAA  stop  codon  which 
truncates  the  vpr  protein  prematurely  after  the  first  32  amino  acid  residues.  Since  the  JSP4-27 
provirus  is  fully  replication-competent,  it  can  be  concluded  that  the  vpr  gene  product  is  not 
required  for  in  vitro  replication  of  HIV-2.  This  conclusion  was  confirmed  by  the  biological 
analysis  of  a  second  vpr-deficient  HIV-2  provirus  independently  constructed  in  our  laboratory 
(J.C.  Kappes  and  B.H.  Hahn,  unpublished),  as  well  as  by  the  findings  of  others  (15). 
Moreover,  since  vpr-deficient  proviruses  of  HIV-2  are  also  cytopathic  and  fusogenic,  it  is 
unlikely  that  the  lack  of  a  functional  vpr  gene  in  HIV-2/ST  is  responsible  for  its  attenuated 
phenotype. 

Pairwise  sequence  alignments  of  JSP4-27  to  other  cytopathic  strains  of  HIV-2  similarly 
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revealed  no  genetic  features  unique  for  HIV-2/ST.  Comparison  of  HIV-2/ST  and  HIV-2/ROD 
demonstrated  an  overall  sequence  divergence  of  11%,  which  is  within  the  expected  range  of 
genetic  variability  observed  among  geographically-distant  isolates  of  HIV-2  (Tables  1  and  2). 
Three  other  recently  reported  HIV-2  viruses,  HIV-2/ISY  derived  from  a  Gambian  individual  with 
AIDS  (1,18),  HIV-2/NIHz  derived  from  an  AIDS  patient  from  Guinea  Bissau  (54),  and  HIV- 
2/GH  derived  from  an  AIDS  patient  from  Ghana  (24,30)  differ  from  HIV-2/ROD  (Cape  Verde 
Islands,  refs.  12  and  23)  by  11%,  12%,  and  12%,  respectively.  Among  all  these  viruses,  the 
Senegalese  HIV-2/ST  virus  was  found  to  be  most  closely  related  to  the  Gambian  isolate 
HIV-2/ISY,  which  shared  90%  of  its  nucleotide  sequence  with  HIV-2/ST. 

Since  infectivity,  syncytia  formation,  and  cell  fusion  are  viral  properties  which  are 
mediated  by  the  viral  env  gene,  we  examined  this  gene  in  particular  with  respect  to  sequence 
differences  unique  for  HIV- 2/ST.  An  alignment  of  the  deduced  HIV-2/ST  env  sequence  to 
those  of  six  other  cytopathic  and  fusogenic  HIV-2  and  SIV  viruses  is  depicted  in  Figure  3. 
Overall,  the  size  of  the  various  env  sequences  compared  is  approximately  the  same.  In 
contrast  to  other  HIV-2  and  SIVMAC  viruses,  JSP4-27  contains  no  in-frame  stop  codon  in  its 
transmembrane  envelope  domain  (Figure  3).  This  is  consistent  with  the  presence  of  a  43kD 
rather  than  a  32kD  transmembrane  glycoprotein  on  Western  blots  of  JSP4-27  derived  virions 
(Figure  IB),  and  is  distinct  from  the  protein  profile  of  the  HIV-2/ST  parental  strain  which 
comprises  a  mixture  of  viruses  with  both  full-length  as  well  as  truncated  transmembrane 
proteins  (34).  Pairwise  sequence  alignment  shows  that  the  HIV-2/ST  env  sequence  differs 
from  other  HIV-2  and  SIV  envelopes  to  the  same  degree  as  they  differ  from  each  other,  with 
amino  acid  sequences  varying  between  16%  and  30%  (Table  1).  25  of  32  cysteine  residues, 
including  22  of  23  located  in  the  extracellular  domain,  are  conserved  among  all  viruses,  which 
indicate  a  highly  conserved  envelope  structure.  In  addition,  HIV-2/ST  contains  28  potential 
N-linked  glycosylation  sites  which  are  arranged  in  a  pattern  similar  to  that  of  other  viruses,  and 
which  also  include  one  highly  conserved  glycosylation  site  previously  shown  to  be  critical  for 
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HIV-1  infectivity  (51).  Finally,  the  HIV-2/ST  env  gene  contains  highly  variable  regions  which 
correspond  closely  in  distribution  and  size  to  similar  hypervariable  regions  in  the  other  env 
gene  sequences. 

Although  a  three-dimensional  structure  has  not  been  determined  for  any  HIV  or  SIV 
envelope  glycoproteins,  there  are  certain  envelope  domains  whose  functions  have  been 
characterized  by  mutagenesis  analysis.  These  include  the  putative  CD4  binding  domain 
(35,37),  the  envelope  precursor  cleavage  site  (40),  and  the  viral  fusion  sequence  (5,6,20,35). 
Since  sequence  changes  in  any  one  of  these  domains  could  alter  the  fusogenic  properties  of 
a  virus  (35),  we  analyzed  the  envelope  sequence  of  HIV-2/ST  for  particular  mutations  in  these 
areas.  No  changes,  or  only  conservative  amino  acid  changes  were  found  in  an  envelope 
domain  of  JSP4-27  that  corresponds  to  the  HIV-1  envelope  region  previously  identified  to  be 
involved  in  CD4  binding  (37).  HIV-2/ST  also  contained  an  apparently  functional  primary 
envelope  precursor  cleavage  site,  with  a  recognition  sequence  (RNKR)  identical  to  that  of  three 
other  fusogenic  HIV-2  or  SIV  viruses  (compare  Figure  3).  In  contrast  to  these  viruses, 
however,  HIV-2/ ST  was  found  to  differ  in  2  of  16  highly  conserved  amino  acid  residues  at  the 
N-terminus  of  the  transmembrane  envelope  glycoprotein  which,  as  shown  by  site-directed 
mutagenesis,  contains  the  viral  fusion  domain  (5).  The  mutations  include  an  alanine  to 
threonine  change  involving  amino  acid  residue  517  (position  12  after  the  cleavage  site),  and 
a  serine  to  alanine  change  involving  amino  acid  residue  521  (position  16  after  the  cleavage 
site).  Only  one  other  fusogenic  HIV-2  virus,  HIV-2/ISY,  contained  these  same  changes. 
However,  this  virus  exhibited  three  additional  mutations  in  this  same  envelope  area  (compare 
Figure  3).  Since  the  fusion  domain  is  generally  highly  conserved  among  cytopathic  HIV  and 
SIV  viruses  (5),  we  considered  the  possibility  that  the  non-fusogenic  properties  of  HIV-2/ST 
resulted  from  these  mutations. 

To  determine  if  the  observed  amino  acid  substitutions  in  the  HIV-2/ST  envelope  fusion 
region  were  likely  responsible  for  the  impaired  cytopathic  properties  of  this  virus,  we  examined 
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two  fusogenic  variants  of  HIV-2/ST,  termed  ST/24.1  C  and  ST/24.2C  (Figure  4).  Both  fusogenic 
strains  were  originally  derived  from  a  biologically-cloned  sub-culture  of  HIV-2/ST,  termed  ST/24, 
which  produced  non-cytopathic  and  non-fusogenic  virions  biologically  indistinguishable  from  the 
parental  HIV-2/ST  isolate  (34).  Following  serial  cell-free  transmissions  of  ST/24  supernatant 
to  uninfected  SupTI  cells,  large  and  numerous  syncytia  were  observed  on  two  independent 
occasions,  which  indicated  the  emergence  of  fusogenic  progeny  virus  in  the  culture.  Two  cell 
lines  were  subsequently  established  (ST/24. 1C  and  ST/24.2C)  and  shown  to  produce  virions 
with  fusogenic  and  cytopathic  properties  similar  to  prototype  HIV-1  and  HIV-2  viruses  (J.A. 
Hoxie,  et  al.,  in  preparation).  Moreover,  these  cell  lines  were  confirmed  to  be  infected  with 
HIV-2/ST  by  Southern  blot  analysis,  which  revealed  no  changes  in  their  BamHI,  Nhel,  Hindlll, 
and  Pstl  cleavage  patterns  as  compared  to  ST/24.  In  order  to  identify  the  molecular  basis  for 
the  phenotypical  change  in  these  variants  and  to  determine  whether  a  direct  mutation  of  the 
viral  fusion  sequence  had  occurred,  we  amplified  the  envelope  fusion  domain  of  these  cultures 
using  the  polymerase  chain  reaction  (46). 

Two  oligonucleotide  primers  (30  mers)  were  designed  to  allow  the  amplification  of  a  544 
bp  envelope  fragment  from  virally-infected  cellular  DNA,  which  included  the  putative  precursor 
cleavage  site  as  well  as  the  envelope  fusion  region  (Figures  5  and  6).  Both  primers  were 
synthesized  according  to  the  JSP4-27  sequence,  however,  sequence  changes  were  introduced 
so  as  to  accommodate  a  BamHI  site  in  the  5’  amplimer  and  a  Pst  I  site  in  the  3’  amplimer 

(primer  1:  5'AGAATTGGGGGATCCTAAATTGATAGAAGT  3';  and  primer  2:  5'GCTATTT AATTTCTGCAGTT CATACAT GTT  3'). 

Total  genomic  DNA  of  ST/24,  ST/24.1C,  ST/24.2C,  as  well  as  ST/B12  as  a  control,  was 
amplified  using  these  primers.  100  p.1  of  reaction  mixture  contained  10  mM  Tris-HCI,  pH  8.3, 
50  mM  KCI,  1.5  mM  MgCI2,  0.01%  gelatine,  200  mM  of  each  deoxynucleotide  triphosphate 
(dNTP),  10  pM  of  each  primer,  2.5  U  of  Taq  polymerase  and  1  ng  of  high  molecular  weight 
DNA.  Samples  were  subjected  to  45  amplification  cycles,  consisting  of  a  denaturing  step  at 
94°C  for  90  seconds,  a  primer-annealing  step  at  50°C  for  90  seconds,  and  a  primer  extension 
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step  at  72°C  fcr  135  seconds.  Amplified  envelope  fragments  were  purified,  cleaved  with 
BamHI  and  Pstl,  and  subsequently  cloned  into  M13.  Ten  individual  M13  clones  per  amplified 
DNA  preparation  were  then  isolated,  and  each  clone  was  sequenced  in  the  region,  which 
corresponded  to  the  N-terminus  of  the  transmembrane  envelope  domain.  An  alignment  of 
these  sequences  is  depicted  in  Figure  5. 

All  10  M13  clones  derived  from  ST/B12  contained  sequences  identical  to  JSP4-27, 
which  indicated  that  the  PCR  amplification  procedure  was  reliable  and  did  not  cause  frequent 
misincorporations  of  nucleotides  in  this  particular  DNA  template.  Sequence  comparison  of 
amplified  fragments  from  cell  line  ST/24  demonstrated  no  differences  among  the  individual  Ml 3 
clones,  but  revealed  4  nucleotide  point  mutations  between  these  ST/24  sequences  and  the 
JSP4-27  reference  sequence,  in  fact,  all  ST/24  derived  viruses,  including  the  cytopathic  and 
fusogenic  ones,  exhibited  these  same  four  nucleotide  sequence  differences,  as  well  as  the 
threonine  and  alanine  substitutions  previously  identified  in  the  envelope  fusion  region  of 
JSP4-27.  The  results  thus  confirmed  that  the  observed  fusion  sequence  mutations  were 
representative  of  all  HIV-2/ST  strains,  regardless  of  their  origin  and  biological  phenotype. 
Interestingly,  eight  of  nine  Ml 3  clones  representing  ST/24.1C  and  five  of  ten  clones 
representing  ST/24.2C  contained  additional  point  mutations,  which  predicted  three  amino  acid 
sequence  changes  with  respect  to  the  ST/24  sequence  (Figure  6).  The  presence  of  these 
mutations  within  the  amplified  material  identifies  the  ST/24.1  C  and  ST/24.2C  viral  strains  as 
mixtures  which  comprise  the  parental  ST/24  virus  as  well  as  additional  genotypic  variants. 
These  newly-generated,  genotypical  variants  must  be  responsible  for  the  phenotypical  changes 
seen  in  the  ST/24.1  C  and  ST/24.2C  cultures.  However,  the  genetic  changes  which  are  of 
biologically  significance  do  not  appear  to  occur  in  the  envelope  fusion  domain. 

While  these  studies  were  in  progress,  the  biological  properties  of  the  HIV-2/ST  envelope 
gene  products  were  also  analyzed  in  a  eukaryotic  expression  system  (42).  Vaccinia  virus 
expressed  JSP4-27  envelope  glycoproteins  were  compared  to  that  of  prototype  HIV-1  and 
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HIV-2  viruses.  While  the  processing,  expression,  and  transport  to  the  cell  surface  appeared 
to  be  unaltered,  vaccinia  virus-produced  JSP4-27  envelope  glycoproteins  failed  to  form  syncytia 
with  CD4-bearing  Hela  cells.  Since  no  other  HIV-2  proteins  were  produced  in  this  system, 
these  results  strongly  suggested  that  the  JSP4-27  envelope  glycoproteins  were  primarily 
responsible  for  its  non-fusogenic  and  non-cytopathic  phenotype  (42).  Based  on  these  results 
as  well  as  the  sequence  data,  we  conclude  that  the  non-fusogenic  and  attenuated  properties 
of  HIV-2/ST  are  indeed  a  function  of  its  envelope  gene  products,  although,  the  causative 
genetic  defect  appears  not  to  involve  mutations  within  the  envelope  fusion  domain. 

There  are  several  mechanisms  other  than  a  direct  alteration  of  the  fusion  sequence  that 
could  result  in  the  biological  changes  observed  in  HIV-2/ST.  These  include  mutations  that 
affect  envelope/CD4  interactions,  mutations  that  cause  differences  in  envelope  glycosylation, 
mutations  that  require  additional  cell  surface  molecules  to  facilitate  virus-cell  fusion,  as  well  as 
mutations  that  reduce  the  stability  of  envelope  glycoprotein  complexes  on  the  cell  surface.  In 
fact,  several  naturally-occurring  as  well  as  genetically-engineered  immunodeficiency  viruses 
have  been  described,  that  are  altered  in  their  fusogenic  or  cytopathic  properties  because  of 
one  of  these  mechanisms.  For  example,  possible  differences  in  the  binding  affinity  of  HIV  and 
SIV  envelope  glycoproteins  to  the  CD4  receptor  have  been  suggested  by  the  finding  that  25- 
fold  more  soluble  CD4  is  necessary  to  block  infectivity  of  prototype  HIV-2  viruses  compared 
to  HIV-1  (10).  It  is  possible  that  the  HIV-2/ST  envelope  glycoprotein  binds  the  CD4  molecule 
with  an  even  lower  affinity,  which  would  be  expected  to  influence  subsequent  steps  of  viral 
entry,  including  membrane  fusion  and  penetration.  Another  mechanism  known  to  cause 
attenuation  of  virulence  in  naturally-occurring  retroviruses  involves  differences  in  post- 
translational  modifications  of  envelope  glycoprote.ns.  Mullins  and  coworkers  showed  that  the 
pathogenic  determinants  of  an  immunodeficiency-causing  FeLV-FAIDS  virus  were  dependent 
on  the  processing  of  particular  envelope  oligosaccharides  (44).  Since  HIV-2/ST  differs  in 
number  and  distribution  of  its  potential  N-linked  envelope  glycosylation  sites  from  other 
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cytopathic  HIV-2’s,  and  since  size  differences  between  the  exterior  envelope  glycoproteins  of 
fusogenic  and  non-fusogenic  HIV-2/ST  strains  have  been  observed  (J.A.  Hoxie,  personal 
communication),  a  biologically-significant  change  in  the  sugar  composition  of  the  HIV-2/ST 
envelope  cannot  be  excluded.  Finally,  a  requirement  of  accessory  molecules  for  virus-cell 
fusion  represents  still  another  potential  mechanism  to  influence  retroviral  cytopathicity.  Studies 
involving  SIV.MC  recently  revealed  that  this  virus  has  a  restricted  host-cell  range  which 
comprises  only  a  subset  of  CD4+  T-cell  lines  (29,33).  While  highly  infectious  and  cytopathic 
for  HUT78  and  H9  cells,  SIVMAC  does  not  fuse  with  CD4-bearing  SupTI  cells.  Moreover, 
SIVMAC  infects  SupTI  cells  only  with  considerable  delay.  It  is  thus  conceivable,  that  SIVMAC 
requires  a  surface  molecule(s)  in  addition  to  CD4  to  establish  a  productive  infection  in  certain 
human  T-cell  lines.  Since  its  infection  kinetics  and  lack  of  cytopathic  effect  in  SupTI  cells  very 
much  resemble  those  of  HIV-2/ST,  it  is  not  unreasonable  to  speculate  that  HIV-2/ST  might 
similarly  require  an  additional  cell  surface  molecule(s)  for  efficient  cell  fusion  or  penetration. 

The  availability  of  cytopathic  variants  of  HIV-2/ST  will  be  instrumental  for  future 
experiments  designed  to  define  the  exact  molecular  determinants  involved  in  HIV-2/ST 
attenuation.  Molecular  clones  representing  the  fusogenic  and  cytopathic  HIV-2/ST  strains  are 
expected  to  exhibit  much  less  genetic  divergence  with  respect  to  JSP4-27  than  do  unrelated 
HIV-2  viruses,  like  HIV-2/ROD  or  HIV-2/ISY.  Therefore,  a  comparative  sequence  analysis  is 
more  likely  to  identify  biologically-important  differences,  and  the  construction  of  chimeras 
between  attenuated  and  cytopathic  clones  will  be  greatly  facilitated.  The  fact  that  cytopathic 
and  fusogenic  ST/24  mutants  evolved  by  cell  free  passage  on  two  independent  occasions 
indicates  the  presence  of  strong  selective  pressures  for  cytopathic  and  fusogenic  viruses  in 
vitro.  It  is  possible  that  similar  pressures  are  also  present  in  vivo  which  may  favor  the 
emergence  of  more  virulent  strains  in  certain  HIV  infected  individuals  over  time  (2,9,50). 
Identification  and  molecular  characterization  of  a  novel  subgroup  of  SIV,„u  Viruses 

Wild  caught  African  green  monkeys  (AGM)  from  East  and  West  Africa  are  infected  with 


10 


SIV  viruses  at  a  high  prevalence  rate  (30-60%)  in  contrast  to  Asian  macaques  which  are  not 
naturally  infected  (14,31).  Because  of  this,  it  has  been  speculated  that  AGMs  could  harbor 
viruses  that  served  as  the  progenitor  of  human  immunodeficiency  viruses  acquired  through 
cross-species  transmission.  Recent  sequence  analyses  of  a  sooty  mangabey  virus  has  shown 
it  to  be  most  closely  related  to  HIV-2  (27).  To  date,  isolation  and/or  molecular  characterization 
of  SIVAGM  from  West  Africa  has  not  been  reported.  Of  additional  interest  is  the  fact  that  wild 
caught  AGMs,  unlike  experimentally  infected  rhesus  macaques,  do  not  appear  to  develop 
disease  as  a  result  of  SIV  infection.  Because  a  molecular  analysis  of  SIVAGM  from  West  Africa 
could  provide  important  information  relevant  to  the  evolutionary  relationships  of  all  HIV  and  SIV 
viruses,  and  because  the  biologic  properties  of  these  viruses  are  likely  to  be  enlightening,  we 
developed  a  PCR-based  approach  for  cloning  these  viruses  directly  from  uncultured  AGM 
peripheral  blood  lymphocytes.  Figures  7  and  8  illustrated  the  "nested"  PCR  technique  that  we 
developed  which  accomplishes  this.  Using  two  sequential  30  cycle  rounds  of  amplification,  first 
with  an  outer  primer  set  and  then  by  an  inner  set  containing  cloning  sites  for  the  Ml 3 
polylinker,  we  successfully  cloned  SIVAGM  from  four  out  of  four  seropositive  AGMs  and  from 
zero  of  two  seronegative  animals.  Figures  9,  10  and  11  show  the  nucleotide  sequences  of 
the  amplified  regions  of  the  polymerase  gene  and  their  relationship  to  SI  Vs  from  other  simian 
species  and  subspecies.  These  results  are  important  for  the  following  reasons:  (i)  they 
demonstrate  for  the  first  time  the  utility  of  the  generic  (universal)  "nested"  PCR  approach  for 
detecting  and  cloning  novel  groups  of  viruses  whose  sequence  divergence  from  known  viruses 
precludes  conventional  hybridization  and  cloning  approaches;  (ii)  they  identify  the  SIV  strain 
infecting  feral  West  African  AGMs  (sabeus)  and  show  that  this  virus  group  is  not  closely 
related  to  HIV-1  or  HIV-2;  (iii)  they  provide  a  rapid  method  for  obtaining  molecular  probes  of 
novel  viruses  for  full-length  lambda  phage  cloning  and  other  kinds  of  nucleic  acid  analyses. 
Molecular  Cloning  and  Analysis  of  Replication  Competent  HIV-1  Proviruses  from  Uncultured 
Human  Brain  in  AIDS  Dementia  Complex 


11 


All  previous  full-length  replication-competent  proviral  clones  of  HIV-1  have  been  obtained 
from  cell  cultures  of  amplified  virus,  due  to  the  low  abundance  of  viral  DNA  in  chronically 
infected  individuals.  Because  such  virus  strains  may  be  altered  be  a  result  of  in  vitro 
propagation,  we  sought  to  molecularly  clone  and  genetically  and  biologically  characterize  full- 
length  replication  competent  HIV-1  proviruses  directly  from  uncultured  human  brain  tissue  of 
a  patient  with  AIDS  Dementia  Complex  (ADC).  Objectives  were  (i)  to  generate  replication 
competent  proviral  clones  from  uncultured  human  tissue  thereby  allowing  an  analysis  of 
genome  organization  and  gene  structure-function  relationships  of  virus  not  subjected  to  in  vitro 
selection  pressures,  (ii)  to  generate  transfection-derived  genetically  defined  HIV-1  strains 
without  interim  ceil  culture  for  analysis  of  virus  biological  properties,  replicative  DNA 
intermediate  forms,  integration  status,  and  the  possible  existence  of  defective  and/or  helper 
forms,  (iii)  to  compare  and  characterize  genotypic  variation  of  proviral  clones  obtained  by 
direct  lambda  phage  cloning  as  compared  to  clones  obtained  by  PCR  amplification.  For  these 
studies,  high  molecular  weight  DNA  from  a  brain  specimen  obtained  at  necropsy  was  subjected 
to  lambda  phage  cloning  and  10  HIV-1  proviral  clones  out  of  8  x  106  recombinants  were 
obtained  (Figure  12).  These  proviral  clones  contained  integrated  and  unintegrated  forms,  forms 
with  one  or  two  LTRs,  and  genomes  with  large  deletions  and  self-integrated,  reversed  LTR 
sequences.  Eight  HIV-1  proviral  clones  (in  lambda)  and  11  PCR  derived  clones  from  the 
same  brain  were  sequenced  in  a  525  bp  hypervariable  envelope  region.  All  19  clones  were 
highly  related  yet  distinct  with  nucleotide  variation  between  0.1  and  0.3%.  Four  proviral  clones 
in  lambda  were  determined  to  be  full-length  by  restriction  mapping  and  two  of  these  were 
shown  to  be  transfection-competent  in  Cos-1  cells  and  replication  competent  in  human  T  cells 
and  monocytes  after  cell  free  passage.  We  have  found  in  preliminary  studies  that  these 
molecularly  derived  strains  of  HIV-1  from  uncultured  human  brain  replicate  to  higher  levels  in 
monocyte-macrophages  than  other  putative  monocyte  tropic  viruses  and  therefore  they 
represent  important  reagents  for  characterizing  monocyte  tropic  viral  determinants  at  a 
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molecular  level.  The  genomic  organization,  structure-function  characteristics  of  specific  gene 
products,  and  the  biological  properties  of  these  HIV-1  proviral  clones  are  under  study. 
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Figure  1.  A:  Construction  of  a  replication-competent  HIV-2/ST  plasmid  clone  .  The  construction 
scheme  used  to  generate  a  transfection-competent  plasmid  subclone  of  HIV-2/ST  is  outlined.  XJSP4-27  (34) 
was  partially  cleaved  with  Xba  I  to  remove  phage  arms  (double  lines)  and  flanking  cellular  sequences 
(hatched  lines),  and  the  resulting  14  Kb  provirus-containing  fragment  was  subsequently  subcloned  into  pSP65. 
B:  Biological  comparison  of  HIV-2/ST  and  HIV-2/ROD  derived,  genetlcally-pure  viral  strains.  CEMx174 
cultures,  productively-infected  (>90%)  with  JSP4-27  (HIV-2/ST)  and  SL1  (HIV-2/ROD)  respectively,  were 
examined  in  syncytia  formation  assays  and  by  Western  blot  analysis.  The  left  side  panels  depict  the  complete 
lack  of  syncytium  formation  upon  co-cultivation  of  JSP4-27-producing  CEMx174  cells  with  uninfected  CEM 
cells,  while  numerous  and  large  syncytia  are  generated  upon  co-cultivation  of  the  same  uninfected  CEM  cells 
with  SL1 -infected  CEMx174  cells  (identical  results  were  obtained  with  H9,  SupTl  and  CEMx174  cells). 
Syncytia  formation  was  monitored  18  hours  after  cocultivation.  The  right  side  panel  depicts  the  Western  blot 
profile  of  cell-free  virions  derived  from  these  same  transfection-derived  cultures  and  demonstrates  differences 
in  the  size  of  their  transmembrane  envelope  glycoproteins. 
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.  R< - >US.  200  .... 

GCTCCACCCTTGCTTGCTTAAAAGACCTCTTAATAAAGCTSCCAGTTAGAAGCAAGTTAAGTGTGTGCTCCCATCTCTCCTAGTCGCCGCCTGGTCATTCGGTGTTCATCTAAACTAACA 
.  .  .  .  .  US  <—  t  PBS  ..... 

AGACCCTGG'TCrGrTAGGACCCTTTCrCCrrTGGGAA  ACCAAGGCAGGA  AAATCCCrACCAGGTTGGCGCCCGAACAGGGACTrGAACAACACTGAGAAGCCTTGGAACACGGCTGAGTC 
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aaggcagtaagggccgcaggaacaaaccacgacggagtcctcctagaaaaccgcagccccacgtaccaagcgccgcctgtggagcgggagtcaaagaggcctccggctgaaggtaagtgc 

S00  ........  .  600 

CTACACCAAATACAGTAGCCAGAAGCGCtTGTTATCCTACCTTTAGACGGGTAGAAGATTGTGGGAGATGCGCGCCAGAAACTCCGTCTTCAGACGGAAAAAAGCACACCAATTACAAAA 

gag  >  Me c Gi yA 1 aAr gA*nS«rVa i LeuArgGlyLysLysAl aAspGl uLeuGl uLy 

700 

GATTAGGTTACGGCCCGGCGGAAAGAAAAAATATAGGCTAAAACATATTGTGTGGGCAGCG AATGAATTGGACACA TTCGGATTGGCAGACACCCTGTTGGAGTCAAAAGAGGGTTGCCA 
sIleArgL*uArgProGlyGlyLysLysLysTyrArgLeuLysHisrleValTrpAlaAlaAsnCluLeuAspArgPheGIyLeuAlaGluS«rLeuI.auC.luS«rI.ysGiuGlyCy*Gl 
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AAAAATTCTTACAGTTTTAGArCCArTAGrACCCACAGGCTCAGAAAATTTAAAAAGCCrTTTTAArACTGTCrcCGrCArrTGCTGTATACACGCAGAAGACAAACCGAAACATACTCA 
nLys I leLeuThrValLeuAapProLeuVal ProThrGly SerGl uAsnLeu  Ly s  Se  rLauPheAsnThrValCysVal I leTrpCy s IleHi sAlaGluGluLy sAlaLy s AapThrGl 
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agaagcaaaacaaaaggtacagagacatctagtggcagaaacaaaaactacagaaaaaatgccaagtacaagtagaccaacagcaccacctagcccgaaccgaggaaacttccccgtaca 

uGIuAlaLy  sGlnLy aValGlnArgHis LeuValAlaGluThrLy sThrThrCl uLy sMat ProSe rThrSarArgProThrAlaProProSarGlyAsnGlyGlyAsnPheProValGl 

1000  ........ 

ACAAGTGGCCGGCAACTATACCCATGTGCCACTAAGTCCCCGAACCCTA AATGCTTCGGTAAAACTAGTAGAGCAAAAGAAGTTCGGGGCACAAGTAGTGCCAGGATTTCAGCCACTCTC 
nGlnValAlaGlyAsnTyrThrHlsValProLeuSerProArgThrLeuAsnAlaTrpValLysLeuValGluGluLysLysPheGlyAlaGluValValProGlyPheGlnAlaI.«uSa 

1100  .........  1200 

AGAAGGCTGCACGCCCTATGATATTAATCAAATGCTTAAT7GTGTGGGCGACCATC AAGCAGCTATGCAAATAATCAGGGAAATTATTAATGAAGAAGCAGCAGATTGGCACGCACAACA 
'rGluGiyCysThcProryrAspIIeAsnGlnMetLeuAsnCysValGlyAspHisClr.AiaAlaMetGlnl lelleArgGluIlellaAanGluGluAlaAlaAsptrpAspAlaClnMi 

1300 

CCCAATACCAGGCCCCrTACCAGCGCCGCAGCTCAGGGACCCAAGGGGATCTGACATAGCAGGGACAACAAGCACAGTAGAACAGCACATCCAGTGGATGTTTAGGCCACAAAATCCTGT 
sProileProGlyProLauProAlaGlyGinLeuArgGl  uProArgGlySerAspIleAiaGlyThrThrSerThrValGluGluGlnlleGlnTrpMatPhaArgProClnAsnProVa 

1400  .... 

ACCAGTAGGAAGCATCTATAGAAGATGGATCCAGATAGGGCTACAGAAGTGCGTCAGGATGTACAACCCAACCAACATCCTACACATAAAACAGGGACCAAAGGAGCCArTCCACAGTTA 
IProValGlySar I laTy r ArgArgTrpI leGlnl leGly LeuGlnLy sCysValArgMetTyrAsnProThrAsnlleLeuAspI leLysGlnGlyProLysGl uPr oPheG InSarTy 

1500  ...... 

tgtagatagattctacaagagcttgaggccagaacaaacacatccagcagtaaaaaattggatgacccaaacactgctagtgcacaatgccaacccagactctaagttagtactaaaagg 
rVal AspArgPheTyrLysSerLeuArgAlaGluGlnrhrAspProAlaVal LysAsnTrpMetthrGlnThr LcuLauValGlnAsnAlaAsnProAspCysLy s  LeuVal LeuLyaGl 

1600  ........ 

AC7AGGGA7AAATCC7ACC77AGAACAAATGC7AACCCCC7G7CAGGCGG7AGG7GGACCAGGCCAGAAAGCCAGA77AA7GGCAGAAGCCt7AAACGAGGCCA7GGCACCAGCCCCCAT 
y  LeuGiy  I  la AsnProThrLeuGi uGluMe  c  Le  uTft r A iaCys Gl ft GlyVa lei yd yProGiyG InLysAlaArgLeuMerAlaGi uAlaLeuLysGluAlaMetAlaPr  oAlaProIl 

17C0  .........  1800 

CCCA777GCAGCAGCCCAACAGAGAAGGAC AATTAAG7GC7GG AA77GCGCAAAGGAAGGGCAC7CGGCAAGACAA7GCCGAGCACCTAGAAGACAAGGC7CCTGG AAA7GTCGCAAGGC 

pol  >  LysThrArgLeuLeuGl uMetTrpGlnGly 

eProPhaAlaAlaAlaGlnGlnArgArgThr IleLysCysTrpAsnCysGlyLysGluGlyHiaSarAlaArgGlnCysArgAlaProArgArgGlnGlyCystrpLysCysClyLyaAl 

19C0 

AGGACACATCA7GGCAAAA7GCCCAGAAAGACAGGCGGG7TT777AGGC77CGGCCCA7GCGGAAAGAAGCCCCGCAA7T7CCC7G7CGCCCAAA7CCCGCAGGGGC7GACACCAACAGC 
Arg7hrHisHlsGlyLysMet?roArgLys7hrGlyGlyPhePheArgValGlyProMetdyLy sGluAlaProGlnPheProCysGlyProAsnProAlaGlyAlaAspThrAsnSar 
aGlyHlsI leMetAlaLysCysProGluArgGlnAlaGlyPheLeuGlyLeuGlyProTrpGlyLysLysProArgAsnPheProValAlaGinllaProGinGlyLeuThrProThrAl 

200C  .... 

ACCCCCGA7AGACCCAGTAGAGGACC7ACTAGAGAAGTACATGCAGCAAGGCAAAAGGCAGACAGAGCACACAGAGACGCCA7ACAAAGAAGrGACAGAGGACT7CC7GCAGCTCGACAA 
ThrProAspArgProSarArgGlyProThrArgGluValHisAlaAlaArgGiuLysAiaGluArgAlaGluArgGluAlal  leGlnArgSerAspArqGlyLeuProAlaAlaArgGlu 
aProProIlaAspProValGluAspLeuLeuGluLysTyrMecGlnGlnGlyLysArgGlnArgGLuGlnArqGluArgProTyrLysGlaValThrGiuAspPheLauGlnLeuGluLy 

2100  ...... 

ACAAGAGACACCATGCAGAGAGACGACAGAGGACTTGCTGCACCTCAATTCTC7CTTTGGAAAAGACCAGTAGTCACAGCACATGT7GAGGGCCAGCCAGTACAAGTTTTGCTACACACA 
Thr  ArgAapThrMatGlnArgAspAspArgGly  LeuAl  aA  laPr  oG  inPhe  Sa  rLeutrpLysArgPr  oval  Vai.ThrAi.afUsValCiuGlyGlnPro'/dlGluVa.l  Leu  LeuAspThr 
sGlnGluThrProCysArgGluThrThrGluAspLeuLeuHisLauAsnSer LauPheGlyLy sAspGln* • • 

2200  .  .  .  .  . 

GGCGCTGACGACTCAATAGTAGCAGGCGTAGAGTTAGGGAGCAATTATAGTCCAAAGATAGTAGGGGGAATACGGGGATTCATAAATACCAAACAATATAAAAATGTAGAAATAAGAGTA 
GlyAlaAapAspSarlleVaiAlaGly ValGluLeuGlySerAsnTyrSarProLysIleValGlyGly IleGlyGlyPhel  leAsnThrLysGiuTyrLysAsnValGluIleArgVal 

2300  .........  2400 

TTAAATAAAAGAG7AAGAGCCACCA7AA7GACAGGTGA7ACCCCAATCAACA77?77GCCAGAAACA7TC7GACAGCC77AGGCATG7CATTAAA7C7ACCAGTCGCCAACA7AGAACCA 
L«uAsnLy*ArgValArgAlaThrrleMecThrGlyAspThrProIleAsnIlePheGlyArgAanIleLeuThrAlaLeuGiyMetSarLeuA*nLauProValAlaLy*I laGluPro 

2500 

A7AAAAA7AATGC7GAAGCCAGGAAAGGA7GGACCAAAAC7GAGACAA7CGCCC77AACAAAACAAAAAA7AGAGGCAC7AAAAGAGA7C7G7GAGAAAA7CCAAAGAGAGGGCCAGC7A 
rieLysIleMetLeuLyaProGlyLysAspGlyProLysLauArgGlnTrpProLeuThrLysGluLysIleGiuAlaLauLysGlur laCyaGluLysMetCluArgGluGlyGlnLeu 
•  •  ♦  •  ■  .  .  2600  .... 
GAGGAGGCACC7CCAAC7AA7CC77A7AA7ACCCCCACA777GCAATCAAGAAAAAGGACAAAAACAAATGGAGAA7GC7AATAGA7777AGAGAAC7AAACAAGGTAAC7CAACAC77C 
GluGiuAlaProProThrAsnProTyrAsnThrProThrPheAlallaLysLysLysAspLysAsnLysTrpArg.MetLauT  1  eAspPheArgC  1  u  LeuAs  n  Ly  sVa  IThrG  In  AapPhe 


Figure  2:  Complete  nucleotide  sequence  of  the  HIV-2/ST  proviral  genome.  9,672  bp  ol  nucleotide  sequence  are  depicted  along 
with  the  deduced  amino  acid  sequences  of  the  corresponding  viral  proteins.  The  sequence  starts  at  the  5’  cap  site  and  ends  with  the 
3’  polyadenylation  site  of  the  viral  RNA.  The  primer  binding  site  (complementary  to  the  tRNA,„),  the  poiypurin  tract  (PPT),  as  well  as 
short  inverted  repeats  which  flank  the  LTRs  are  underlined.  Core  enhancer  sequences  (E),  SP1  binding  sites  (Spl),  the  lata  box  (TATAA) 
and  the  polyadenylation  site  signal  (AATAAA)  are  shown.  The  U3/R  and  R/U5  boundaries,  as  well  as  the  splice  donor  (SD)  and  splice 
acceptor  (SA)  sites,  have  been  determined  in  analogy  with  HIV-2/ROD  (23).  The  vpr  open  reading  frame  contains  a  premature  in-frame 
TAA  stop  codon  at  position  5777,  which  is  indicated  by  three  asterisks.  Sequence  analysis  was  performed  by  the  chemical  degradation 
melhod  according  to  Maxam  and  Gilbert  (41)  as  well  as  by  the  dideoxynudeotide-chain  termination  method  according  to  Sanger  (48). 
The  nucleotide  sequence  of  JSP4-27  has  been  submitted  lo  the  AIDS  Sequence  Data  Base,  Los  Alamos  National  Laboratories,  as  well 
as  to  Genebank  and  EMBL  libraries. 
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FIGURE  2  (cont) 


2700  ...... 

acagaaatccagttaggaattccacacccagcaggactagccaagaacaaacgaattactgtcctagatgtaggggatgcttacttttccataccactacatgagcattttagacagtat 

7hrGluUaGlnLeuGlyIleProHisProAl4GlyleuAlaLysLysLysArgI le7hrVai LeuAspValGlyAspAlaTyrPheSerl leProLeuHisCluAtpPheArgGlnTyr 

2800  .  ...... 

ACTGCATTTACTCTACCATCAATAAACAATGCTGAACCAGGAA  AAAGATACATATATAAAGTCTCACCACAGGG  ATGGAAGGGATCACCAGCAATTTTTCACTACAC  AATGAGGCACCTC 
ThrAlaPhathrLauProSarllaAsnAsnAlaGluProGlyLy s Arg7y r I leTy r ly s Va 1 Se r P roClnGly7rpLysGlySerProAla I lePheGlnTy r7hrMe tArgGlnVal 

2900  .....  2900  .  .  .  3000 

TTAGAACCATTCAGAAAAGCAAACCCGGATATCATTCTCATTCACTACATGGATGATATCTTGATAGCCACCGACAGGACACATTTAGAACATGACAGAGTGCTTCTGCAGCTAAACGAA 
LeuCluProPheArgLysAlaAsnProAsprielleLauI leGlnTy rMe  cAspAspI leLeuIleAlaSerAspArgr*!rA*pL0uGluHi«A*pArgValValLeuG:n:.euLy3Glu 

3100 

C7TCTAAA7GGCCTGGGA7TT7CCACCCCAGA7GAGAAGTTCCAA  AA  AGACCC7CC  A7ACCAA7GGA7GGGC7A7GAAC7G7GGCCAAC7AAA7GG  AAGC7GCAAAGAA7ACAAT7GCCC 
LeuLauAsnGly LeuGlyPheSarThrProAspGluLysPheGlnLysAspProProTyrClnrrpMeLGlyTyrGluLeuTrpProThrLy*'"rpLy sLeuGlnArgl leGinLeuPro 

3200  .... 

CAAAAGGAAGTATGGACAGTCAATGACATCCAAAAACTGGTGGGTGTCCTAA  ATTGCGCAGCACAAATCTACCCAGGGATAAAGACCAGAAACTTATGTAGGTTAATCAGAGGAAA  k  ATG 
GlnLy sGluVal7rpThrValAsnAspl leGinLysLeuValGlyVal LeuAsn7rpAla AlaGlnl leTyrProGly IiaLysThrArgAsnLeuCy * ArgLeu  I  leArgGlyLy  *»iet 

3300  ...... 

ACACTCACAGAAGAGGTACAGTGGACAGAATTAGCAGAAGCGGAACTACAAGAAAACAAAATCATCTTAAGCCAGGAACAAGAAGGATGCTATTACCAAGAGGAAAAGGAGCTAGAAGCA 

ThrLeuThrGluGluVaiGlnTrpThrGluLauAlaGluAlaGluLeuGluGluAsnLysIlelleLauSarGlnGluGlnGluGlyCysTyrTyrClnGluGluLysGluLauCluAla 

3400  .  .  .  .  . 

ACAGTCCAAAAAGATCAAGACAATCAGTCGACATATAAGATACACCAGCCAGGAAAAATTCTAAAAGTAGGAAAATATGCAAAGGTAAAAAATACCCACACCAACGGAG7CAGACTCCTA 

ThrValGlnLysAspGlnAspAsnGinTrpThrTyrLysIlaHisGlnGlyGlyLysIleLeuLysValGlyLysTyrAlaLyaValLysAsnThrHijThrAsnGlyValArgLauLau 

3500  ....  .....  3600 

GC  AC  AAG7AG77CAAAA  AA7AGGAAAAGAAGCAC7AGTCA777GGGGACG  A  A7ACC  A  A  AA777C  AC  CTACCAGTAGAAAG  ACA7ACC7GGGAACAG7GGTGGG  AT  AAC7AC7GGCAAG7G 
AlaGinValVaiGlnLy s IleGlyLy sGluAiaLeuVallieTr pGiyArgl leProlysPheHisLeuProVaiGl  uArgAspThrTrpGluGlnTrpTrpAspAsnTy rTrpGlnVal 

3700 

ACA7GGA7CCCAGAC7GGGAC77CA7A7C7ACCCCGCCAC7GG7CAG  AT7AG7A777AACC7GG7GAAAGA7CCCATAC7AGGCCCAGAAACC77C7ACACAGA7GGA7CC7GCAA7AAG 
7hr7rpIleProAsp7rpA3pPhaIleS«rThrProProLeuValArgLeuVai?heAsnLeuValLyaAspProIlaLaaClyAlaClu7hrPhe7yr7hrAspGlySarCysA*nLy* 

38C0  .... 

CAA7CAAGAGAAGGAAAAGCAGGA7ACA7AACAGA7AGAGGAAGAGACAAGG7GAGGC7AT7ACAGCAAACCACCAA7CAGCAAGCAGAAT7AGAAGCC777GCGA7GGCAG7AACAGAC 

GlnSerArgGluGlylysAlaGlyTyrIle7hrAspArgGlyArgAspLysValArgLeuLeuGluGln7hr7hrAsnGlnGlnAiaGluLeuGluAla?neAlaMetAlaVal7hrA*p 

3900  ...... 

7CAGG7CCAAAGGCCAACA77A7AG7AGACTCACAA7A7G7AA7GGGAA7AG7AGCAGGCCAACCA  ACAGAG7CAGAGAG7AAAA7AG7AAA7CAAA7CATAGAAGAAA7GA7AAAAAAG 
SerGlyProLysAlaAsnllelleValAspSerGlnTyrValMetGlylleValAiaGlyGlnProThrGluSerGluSerLysIleValAsnGinlie  :  leGluGlu.Met  IieLyaLya 

4000  .  .  ...... 

GAAGCAATCTATGTTGCATGGGTCCCAGCCCATAAAGGCATAGGAGGAAATCAGGAGCTAGATCACTTAGTAAGTCACGGCATCAGACAAGTATTATTCCTAGAGAAAATAGAACCCCCT 

GluAlaIla7yrValAlaTrpValProAlaHisLysGlyIlaGlyGlyAsnGlnGluValAspHisL«uValSerClnGlyIlaArgGlnValLeuPheleuGlaLy*Il«GiuProAla 

4100  4200 

CAGGAGGAACA7GAAAAA7A7CATAGCAATG7AAAAGAAC7ATCCCA7AAA77?GGACTGCCCAAATTAG7CGCAAGACAAA7AG7AAACACATGCACCCAA7G7CAGCAGAAAGGGGAG 
GlnGluGluHisGluLysTyrHisSerAsnValLysGiuLeuSerHisLysPheCiyLeuProLysLeuValAlaArgGlnl  leValAsnThrCysThrGi nCysGl nGInLysGIyGlu 

4300 

GCTATACATGGGCAAGTAAATCC  AGAA77AGGCAC77GGCAAA7GGAC7GCACACACT7AGAAGGAAAAATCA7TA7AG7AGCAG7ACATG77GCAAG7GGA7T7ATACAAGCAGAAG7C 
AlalleHisGlyGlnValAsnAlaCluLeuGlyThrTrpGlnMetAspCysThrHisLeuGluGlylysUellelleValAlaValHiaValAlaSerGlyPhelleGluAlaGluVal 

4400  .... 

A7CCCACAGGAA7CAGGAAGGCAAACGGCAC7C77CC7AC7AAAAC7GGCCAG7AGGTGGCCAA7AACACA7T7GCACACAGACAA7GG7GCCAAC77CAC7TCACAGGAAG7AAACATG 
IleProGlnGluSerGlyArgGlnThrAlaLauPheLeuleuLysLeuAlaSarArgTrpProI leThrHi sLeuHiaThrAspAsnGlyAlaAsnPheThrSerGlnGl uValLy«Mat 

4500  ...... 

G7GGCATGG7GGA7AGG7A7AGAACAA7CC77CGGAG7ACC77ACAA7CC  ACAAAGCCAAGGAG7AGTGGAAGCAA7GAA7CACCACC7AAAAAA7CAGA7AAGCAGAA7TAGAGACCAG 
ValAiaTrpTrpIleGly IieGluGlnSerPheGlyValProTyrAsnProGl nSerGlnGlyValValGluAlaMetAsnHisHisLeuLysAanGlnlleSerArgl laArgGluGln 

4600  .  ...... 

GCAAACACAG7AGAAACAA7AC7AC7AA7GGCAG77CAT7GCA7GAA777TAAAAGGAGGGGAGGAATAGGGGATA7GACCCCAGCAGAAAGAC7AA7CAA7A7GG7CAC7GCAGAACAG 
AlaAsnThrValGl  uThr  I  leValLeuMetAlaVal  HisCy  sMet  AsnPhaLy  s  ArgArgGlyGly  I  leGlyAspMetThrProAlaGl  uArgLeuI  laAsr.MetVaiTh  rAiaGluGln 

4700  ....  .....  4800 

GAAArACAA77CC7CCAAGCAAAAAAT7CAAAA77ACAAAA7777CGGG7CTAT77CAGAGAAGGCAGAGA7CAGC7G7GGAAAGGACC7GGGGAAC7AC7G7GGAAGGGGGACGGAGCA 
Clul  leGlnPheLeuGl  nAlaLysAsnSerLy  sLeuGlnAsnPheArgValTy  rPheArgGluGlyArgAspGlnLeuTrpLy  sGiy  ProGlyGlu  LauLeuTrpLy  aclyAspGlyAla 

4900 

GTCATAGTC  AAGGTAGGGGC7GACA7AAAAA7AA7ACCAAGAACGAAAGC7AAGA7CATCAA  AGAC7A7GGAGCAAGCCAAGAGA7GCATAGCGG77CCAAC77GGAGGG7GCCAGGGAG 
Val IlaValLysValGlyAlaAsp I  la Lyi I  lei leProArgArgLy sAlaLy s I leileLysAspTyrGlyGlyArgGlnGluMetAspSerGlySerAsnLeuGiuGlyAlaArgGlu 

wif  >  MecGluCluGly Ly sArg7rpI leAl a Va IP r oThrTrpArgVa 1 P roG 1 yAr 
5000  .... 

G  ATGGAGAGG7GGCA7AGCC7TA7C  AAGTA7C7AAAA7AC  AGAACAGGAGA7C7AGAG  AACG7G7CC7A7G77CCCCACCA7AAGG7CGGA7GCGCG7GGTGCAC77GCAGCAGGGTAA7 
A*pGlyGluValAla••• 

gMet.CiuArgTrpHisSerI.eu I leLysTyrLeuLysTyrArgThrGiyAspLeuGiuLysValCysTyrVaiProHisHisLy sValGlyTrpAiaTrprrpThrCysSerArgValll 

5100  ...... 

ATTCCCATTAAAAGGAGAAAGTCATCTGCAGATACACGCATACTGGAACCTAACACCAG  AAA  AAGGATGGCTCTCCTCCTATTCAGTAAGACTAACTTGGTATACAGAAAAATTCTGGAC 
ePheProLeuLysGlyCiuSarHisLeuCluIleGlnAlaTyrTrpAsnLeuThrProGluLysGlyTrpL.euSerSerTyrSerValArgLeuThrTrpTyrThrCluLysPheTrpTh 

AGATGTTACCCCAGACTGTGCGGACTCCCTAATACATAGCACTTATTTCTCTTGCTTTACGGCAGGCC  AAGTAAGAAGAGCCATCAGACCCGAAAAGCTATTATCCTGCTGCAACTA  .<  4 
rA*pVal7hrProA*pCy*AlaAspSarI.auIlaHl*SerThrTyrPheSarCy*PheThrAlaGlyGluVaiArgArgAlaIl eArgGlyGluLysLeuLeuSarCysCy sA*nTyt  Pr 

5300  ....  .....  5400 

CCAACCCCA7AAGTACCACG7ACCC7C  ACTCCAG777C7GGCC77AG7GG7AG7GCAAC  A  AA  A7GGC  AGGCCCCAGACAGACAA7ACCACCAGGAAAC  AGTGGCGAAGAAACTATCGGAG 
oGlnAiaMlsLysTyrGlnVaiProSerLeuGlnPheleuAlaLeuValValValGlnGlnAsnGlyArgProGlnArgAspAsnThrThrArgLysGlnTrpArgArgAs  nTy rArgAr 

vp*  >  MetAlaGlyProArgGluTJ'.rl  leP roProG LyAsnSerCiyGluG luThr  IleGlyG 


24 


FIGURE  2  (cont) 


.  .  .  .  .  .  .  .  .  5500 

ACGCCTrCGAGrGGCrAGACAGGACGGTAGAAGCCATAAACAGAGAGGCAGrGAACCACCTGCCCCGAGAGCTTATTTTCCAGGTGTGCCAAAGGTCCTGGACATACTGGCATCATCAAC 
gGlyLauArgValAlaArgGlnAapGlyArgSarHisLyaGlnArgGlySarGl uProProAlaProArgAlaTyrPhaProGlyValAlaLysValLauGluIlaLauAia* •• 
luAlaPhaGluTrpLauAipArgThrValGluAlaIlaA*nArgGluAlaVaXA*nHi*LauProAcgGluLauIlePhaGlnValTrpGlnArgSarTrpArgTyrTrpKi»A*pCluG 
.  .  .  .  .  .  5600  ..... 

AAGGAATGTCAATAAGTTACACAAAGTATAGATATTTGTGCCTAATGCACAAAGCTATGTTCATACATTCTAAGACAGCGTGCACTTGCCTGGGGCGAGGACATGGCCCGGCAGGATGGA 
InGlyMe t Sa r IlaSarTy rThrLysTyrArgty rLauCyiLauMac GlnLyaAlaMacP ha XlaHiaSarLy sArgGlyCyaThrCy a LauCiyCiyGXyKiaGiyProGlyGXyrrpA 

5*700  .  . 

GATCAGGACCTCCCCCTCCTCCCCCTCCAGGTCTAGTCTAArGACrGAACCACCAACAGAGTCTCCCCCGGAGCATACCACCCCACCGAGGGAGCCAGGGGATGAGTGGGTAATAGAAAC 

rgSarGlyProProProProProProProGlyLauVal • *  * 

apr  >  MacThrGluAlaProThrGluSerProProGluAapAxgThrProProArgCluProGlyAapCluTrpValllaGiuTh 
.  .  .  5800  ........ 

cctgagacagataaaataagaagctttaaagcactttgaccctcgcttgctaattactcttggcaactatatctatgctagacatcgacacacccttgaacgccccacagggctcattag 

rLauArgGlu! leLysEndGluAlaLauLy sHisPhaAapP  roArgLauLauIlaThrLauGlyAanTyrllaTy rAlaArgHi*GlyAspThr La  uGluGlyAlaArgGly Laul laAr 

•••  tat  >  MatGluThrProLauLysAlaProGluGlySarLauG 

.  5900  ....  .....  6000 

gatcctacaacgagccctcctcttgcacttcagagcaggatgccgccgctcaaggattgctcagcccaggggacgaaatcctttatcagctataccaacccctagaggcatgccataaca 
gllaLauGlnArgAlaLauLauLauHlaPhaArgAlaGlyCysGlyArgSerAcgliaGiyCinProArgGiyArgAanProLauSarAialiaProrhrProArgGlyMaCArg*  ** 
lySarTyrAsnGluProSarSarCysThrSarGluGlnAspAlaAlaAlaGlnGlyLauValSarPr oGlyAspGluXXaLauTyrGlnLauTyrGlnProLauGluAlaCyaAspAanL 

6100 

aatgttactgtaaaaagtgctgctaccattgccagatgtgttttttaaacaaggggctcgcgatatggtatgaacgaaagggcagaagaagaagaactccgaagaaaactaagcctcatt 

ysCysTyrCysLyaLyaCysCysTyrHisCyjGlnMatCyaPbaLauAsnLyiClyLeuGlyllaTrpTyrGiuArgLyaGlyArgArgArgArgThrProLyaLyaThrLyaAlaHiaS 

rav  >  MatAsnGXuArgAXaGluGXuGXuGXuLarjArgArgLysLauArgLauIla 
.  5D  .  .  .  .  .  .  6200  .... 

cgtc??ctgcatcagacaagtsag?aacatgtgtgg?aggaatcaactatttgttgccagcttgctagc?agtccttgcttaatatattgcgtccaatatg?gac?gttttctatggcgt 

arSarSarAlaSarAspLys 

ArgLeuLauHisGlnThrAsn 

ana  >  MatCyaGlyArgAsnGlnLauPheVaiAlaSerLauLauAXasarAlaCysLauI laTy rCy aValGlnTyrValThrValPhaTyrGlyVa 

6300  ...... 

GCCCGTGTGGAGAAATGCATCCATTCCCCTCTTTTGTGCAACTAAAAATAGAGATACTTGGGGAACCATACAGTGCTTGCCACACAATGATGACTATCAGCAAATAGCTTTAAATGTGAC 

IProValTrpArgAanAlaSerlieProleuPhaCyaAiaThrLyaAsnArgAsprhrTrpGlyThrllaGinCysLauProAapAanAspAspTyrGlnCiullaAlaLauAanValTh 

6400  .  .  .  . 

ACAGGCCrTCGACGCArGGAArAArACAGTAACAGAACAAGCAGTAGAAGATGTCTGGACTCTATTTGAGACATCAATAAAACCATCCCTCAAACTAACACCCTTATGTGTAGCAATGCG 
rGluAlaPheAapAiaTrpAsnAsnThrValThrGluGlnAlaValGluAapValTrpSerLeuPhaGluThrSar IlaLysProCy sValLy*LauThrProLauCy*ValAlaMatAr 

6500  .........  6600 

rrGTAACAGCACAACTGCAAAAAACACAACCTCCACACCAACAACCACCACAACAGCAAACACAACAATAGGAGAGAATTCTTCATGCATACGCACAGACAACTGCACAGGGTTGGGAGA 
gCy*A*nsarThrThrAlaLy lAanThrThrSerrhrProThrThrThrThrThrAlaAsnthrThrllaGlyGluAsnSarSarCy sIleArgThrA*pA*nCy*ThrGlyLauGlyGl 

6700 

agaagagatggtcgactgtcagttcaatatgacaggattagagagggataagaaaaaactatataatgaaacatggtactcaaaagatgtagtctgtgaatcaaatgacaccaagaaaga 

uGluGiuMacValAspCyaGlnPhaAsnMatThrGlyLauGluArgAspLysLy  sLy  s  La  ufyrAs  nGiuThr  TrpTyrSarLy  a  AapValValCy  *GXuSarA«nA*pTh  rLy  a  Ly*Gl 
.  .  .  .  .  .  .  6800  .... 

GAAAACATGTTACATGAACCACTGCAACACATCAGTCATCACACAGTCATGTGACAAGCACTATTGGGATACTATGAGGTTTAGATATTCTGCACCACCGGGTTTTGCCCTGCTAAGATG 
uLyaThrCystyrMatAanHisCyjAanThrSarVaillaThrGluSarCyaAapLysHiaTyrTrpAspThrMacArgPhaArgTy  rCysAlaProProGlyPhaAlaLauLauArgCy 
.  .  .  .  .  6900  ...... 

CAATGATACCAATTATTCAGGCTTTGACCCCAATTCTTCTAAGGTAGTACCTGCTACATGTACAAGGATGATGGAAACGCAAACCTCCACTTGGTTTCCCTTTAATGGCACCAGGCCAGA 
sAsnAspThrAanTyrSarGXyPhaGX  uProAsnCyaSarLyaValValAlaAlaThrCysThrArgMatMatGluThrGlnThrSerThrTrpPhaGlyPhaAanGlyThrArgAiaGl 

7000  ........ 

AAATAGAACATATATCTATTGGCATGGTAGGGATAATAGAACCATCATTACCTTAAACAAGTTTTATAATCTCACCGTACATTGTAAGAGGCCAGGAAACAAGACAGTTGTACCAATAAC 
uAanArgThrtyr  IlaTyrTrpHiaGlyArgAspAanArgThr  IXallaSarLauAanLyaPheTy  cAsnLauThrValHisCysLyaArgProGlyAanLyaThrValValProIlaTh 

.  7100  .........  7200 

ACTCATGTCAGCGTTAGTGTTTCACTCCCAGCCAATCAATAGAACACCCAGGCAAGCATGGTGCTCGTTCAAAGGCGAGTGGAAGGAAGCCATGAAGGACGTGAAGCTAACCCTTGCAAA 
rLauMa  csarGiyLeuValPhaHiaSarGlnProI  XaAanArgArgProArgGXnAlaTrpCy  a  TcpPhaLysGlyGluTrpLy  aGluAlaMat  LyaGluValLyaLauThrLauAlaLy 

7300 

ACATCCCAGGTATAAAGGAACCAACGACACAGAAAAAATTCGTTTTATAGCGCTAGCAGAACGCTCAGACCCAGAAGTGGCATACATGTCGACTAACTGCAGAGCAGAATTTCTCTACTG 
aHlsProArgTyrLysGiyrhrAanAapThrGluLyalleArgPhellaAlaLauGlyGluArgSarAjpProGX  uValAlaTyrMatTrpThrAanCyaArgCXyGluPhaLauTyrCy 

7400  .... 

CAATArGACTTGGTTCCTCAArTGGCTAGAAAACAGAACGAATCAGACACAGCACAATTATGTGCCATGCCATATAAAGCAAATAATTAATACCTGGCACAAGCTAGGGAAAAATGTATA 
aAsnMecThrTrpPhaLauAanTrpValGluAanArgThrAanGXnThrGlnHiaAsnTyrValProCy  aHiallaLy  aGlnl  lallaAanThrTrpHlaLy  aValGlyLyaAanValTy 

7500  ...... 

TTTGCC7CCTACGGAAGGACAGTTAACCTGCAACTCTACAGTGACCAGCATAATTGCTAACATTGACGGAGGAGAGAACCAGACAAATATTACCTTTAGTGCACACGTGCCAGAACTATA 
rLauProProArgCluGlyGlnLauThrCysAanSarThrValThrSar  Ilal  laAlaAmllaAapGlyGXyCluAanGlnThrAanl  laThrPhaSarAlaGluValAlaGluLauTy 

7600  ........ 

CCGATTAGAAtTGGGGCATTATAAATTGATAGAAGTAACACCAATTGGCTTTGCACCTACACCAGTAAAAAGATACTCCTCTGCTCCAGTCAGGAATAAAAGAGGTGTATTCGTGCTAGG 
rArgLauGluLauGlyAapTyrLyaLau I XeGluValThrPr oIXaGly PhaAXaProThrProValLy aArgTy  rSarSarAlaPcoVaiArgAanLyaArgGlyValPhaVaiLauCl 

.  7700  .........  7800 

GTtCTTAGGTtTTCTCACGACAGCACCAGCTCCAATGGGCGCGGCGrCCTTGACGCrGTCGGCrCACrCTCGGACrTTATTGGCCGCCATAGTGCAGCAACAGCAACAGCTGTTGGACGT 

yPhaLauGlyPhaLauThrThrAXaGlyAXaAXaMacGXyAlaAXaSarLauThrLauSarAlaGlnSarArgThrLauLauAlaGXylXaValGXnGXnGlnGlnGXnLauLauAapVa 

......  7900 

gctcaagagacaacaagaaatgttgcgactgaccgtctggggaacaaaaaatctccaggcaagagtcactgctatcgagaaatacttaaaggaccagccgcaactaaattcatggggatg 
lValLy*ArgClnClnGluMatLauArgLauThrValTrpClyThrLy*AinLauGlnAlaArgValThrAiaIlaGluLysTyrLauLy*A*pGlnAlaGlnLauA*nSarTrpGlyCy 
.  .  .  •  .  .  .  8000  .... 

TGCGTCTACACAAGTCTGCCACACTACTCTACCATCGGTAAATGACACCTTAACGCCTGATTGGAACAACATGACATGGCAGGAATGCGACCAACGAATCCGCAACCTAGACGCAAATAT 
•AlaSarArgGlnVaXCyaHiaThrThrval  ProTrpValAanAapThrLauThrProAtpTrpAanAanMacThrTrpGlnGluTrpGluGlnArgllaArgAanLauGluAXaAanti 
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FIGURE  2  (cont) 


8100  ...... 

cagtgaaagtttagaacacgcacaaatccagcaagaaaagaacatgtatcaactacaaaaattaaatarctgggatctttttggcaactggtttcatttaacctcctccatcaaatatat 

eSerduSerLeuduGlnAladnlleGlndnGluLysAsnMatTyrGluLauGlnLyaLauAanSarTrpAapValPheClyAanTrpPheAapLauThrSarTrpIlaLyaTyrll 
.  .  .  8200  ........ 

rCAGrATGCAGTTTATATAGTAGTAGGAArAATAGTTTTAAGAATACTAATATATCTAGTACAAATCTTAAGTACACTTACAAAGGCCTATAGGCCTGTTTTCTCTTCCCCCCCCCCTTA 
eGlnTyrGlyValTyrlleValValGlyl leIleValLeuArgIleValIleTyrValValGlnMet.LeuSerArgLeuArgLysdyTyrArgProValPheSerSerProProAlaTy 

SA  8300  .........  8400 

CTTCCAACAGATCCATATCCACAAGCACCCGGAACACCCACCCACAGAAGAAACACAACAACACGTTGGAAACAGCGTrCCAGACAATTCGTCCCCCTCGCCCATAACATATATACATTT 
rPheClnGlnlleHisIleHiaLyaAapArgCluGlnProAlaArgGluGluThrGluGluAapValGlyAanSarValClyAapAanTrpTrpProTrpProIlaArgTyrllaHiaPh 
tat  >  Sar IleSerThrArgThrGlyAsnSerGlnProduLyaLyaGlnLyaLyiThrLeuduThrAlaLeuGluThr I  leGlydyProGlyArg* •• 
rav  >  ProryrProdnGlyProdyThrAlaSerdnArgArgAanArgArgArgArgTrpLysGlnArgTrpArgdnLeuValAlaLeuAiaAapLya IleTyrThrPhe 

8500 

CCTGATCCGCCAGCTGATTCGCCTCTTGAACAGACTATACAACATCTGCAGCGACTTACTATCCAGGAGCTTCCAGACCCTCCAACTAATCTCCCAGAGTCTTCGGAGAGCATTGACAGC 
aLeuIlaArgGlnLeuIleArgLeuLeuAsnArgLeuTyrAanXleCyiArgAapLeuLauSarArgSerPheClnThrLeuGlnLeuI leSarClnSarLauArgArgAlaLauThrAl 
ProA*pProProAlaA»pSarProLauGluGlnThrIlaGlnHl*LauGlnGlyLauThrllaGlnGluLauProAspProProThrA*nLauProGluSarSarGluSarIlaA*pSar 

8500  .... 

AGTCAGAGACTGGCTGAGATTTAACACACCCTACCTGCAATATCGGGGCGAGTGGATCCAAGAAGCGTTCCGAGCCTTCGCGAGGGCTACGCGAGACACTCTTACAAACGCCTCGAGAGG 
aValArgAspTrpLeuArgPhaAsnThrAlaTyrLauGlnTy rClyGlyGluTrpIlaGlnGluAlaPhaArgAlaPhaAlaArgAlaThrGlyGluThrLauThrAanAlaTrpArgGl 
SarGlnArgLeuAlaGluIle*  ** 

oaf  >  MetGlyAlaSarGlySerLysLy a ArgSerGlu ProSe rArgGlyLeuArgGl uArgLeu Laud nThr Prod yGluA 
8700  ...... 

CTTCTGGGGGACACTGGGACAAATTGGGAGGGGAATACTTGCAGTCCCAAGAAGGATCAGGCAGGGGGCAGAAATCGCCCTCCTGTGAGCGACGGCGGTATCAACAGGGAGAtTTtArGA 
yPhaTrpGlyrhrLauGlyGlnllaGlyArgGly IlaLauAlaValProArgArgllaArgGlnGlyAlaGluIlaAlaLauLau*** 

laSarGlyGlyHlsTrpAipLy  «LauGlyGlyGluTy  rLauGlnSarGlndudy  SarGlyArgdyGlnLyaSarProSerCy  sduGlyArgArgTy  rClnClndyAapPhaMatA 

8800  .  .  .  .  . 

ATACCCCATGGAGAGCCCCAGCAGAACGGGAGAAACGCTCGTACAAGCAACAAAATATGGATGATGTACATTCAGATGATGATGACCTAGTAGGGGTCCCTGTCACACCAAGAGTACCAT 

enrhrProTrpArgAlaProAlaCluGlyGluLysGlySerTyrLysdnGlnAsnMetAepAapValAepSarAspAapAspAapLauValGlyValProValThrProArgValProL 

.  8900  .  .  .  PPT  .  I — >03.  ....  9000 

rXAt:ACAAATCACATATAGGTTGGCAAGAGATATGTCACATTTGATAAAAGAAAAGGGCCGACTGGAACGGCTGTATTACAGTGATAGGAGACG7ACAGTCCTAGACATATACTTAGAAA 
auArqGluMatThrTyrArgLauAlaArgAspMat S«  rKi*LauIl«Ly *GluLyaGly61yLauGluGlyL«uTyrTyrS«rA*pArgArgArgArgValLauAapIlaTyrLauGluL 

9X00 

AGGAAGAGCGAATAATTGGAGACTGGCAGAACTATACTCATGGACCAGGAGTAAGGTATCCAAAGTTCTTTCGCTGCTTAtGGAAGCTACTACCAGTAGATGTCCCACAAGAGGGAGATG 
y aGluGluCly I  la  I laGlyAspTrpGl nAanTy rlhrHl sGlyProGlyValArgTy r ProLy sPhePheGlyTrpLeuTrpLy sLeuVal ProValAapValProGlnCludyAspA 

.  .  .  .  9200  .... 

acactgagactcactgcttagtgcatccagcacaaacaagcaggtttcatgacccccatggagaaacattagtttgcaggtttcaccccacgctagcttttagctacgacgcctttattc 
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GATACCCACAGGAGTTTGGGTACAAGTCAGGCCTCCCAGAGCATCAATCGAAGGCAAGACTGAAAGCAACAGGGATACCGTTTAGCTAAAAACAGGAACACCTATACTTCGTCAGGGCAG 

rgTyrProGluGluPhaGlyTyrLyaS«rGlyLauProCluAapGluTrpLyaAlaArgLauLyaAlaArgGlyIlaProPh«Sar*## 

.  .  E  94  00  .  E  .  Spl  Sol  Spl 

GAAGTAACTAACACAAAACAGCTGAGACTCCAGCCACTTTCCAGAACCGCCTGrTACCAGCGCAGCGACATGCGAGGAGCCCGrGGGCAACGCCCTCArACTTTCTGTATAAATGTACCC 

03<— f— >R  .........  9600 

GCTACTCGCATTGTATTCAGTCCCTCTGCGGAGACGCTGGCAGATTGAGCCCTGGGAGGTtCTCTCCAGCACTAGCAGGTACACCCTGGGTGTTCCCTGCTAGACTCTCACCAGTGCTTG 

GCCSGCACTGGGCAGACGGCTCCACCCTTGCTTGCTTAAAAGACCTCTTAATAAAGCTGCCAGTTAGAAGCA 
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Figure  3.  Alignment  of  the  deduced  envelope  amino  acid  sequences  of  seven  different  HIV-2  and  SIV 
viruses.  The  deduced  amino  acid  sequences  of  the  envelope  open  reading  frames  of  HIV-2/ST  (JSP4-27), 
HIV-2/R0D  (23),  HIV-2/ISY  (18),  HIV-2/NIHz  (54),  HIV-2/GH  (24),  SIV/MAC  (8),  and  SIV/SM  (27)  have  been 
aligned  for  maximum  homology.  The  position  of  the  presumed  primary  envelope  precursor  cleavage  site,  as 
identified  for  HIV-2/R0D  (23),  is  shown.  Filled  circles  indicate  cysteine  residues  conserved  among  all  seven 
envelope  sequences,  while  asterisks  indicate  non-conserved  cysteine  residues.  Potential  N-linked  glycosylation 
sites  (NXS/T)  in  the  HIV-2/ST  envelope  sequence  are  underlined.  Crosses  depict  two  in-frame  stop  codons 
which  are  present  in  the  sequence  of  SIV/MAC  and  HIV-2/GH.  Dots  represent  gaps  which  were  introduced 
for  optimal  sequence  alignment. 
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ST/24  10/10 

ST/24. 1C  1/9 

ST/24. 1C  8/9 

ST/24. 2C  5/10 

ST/24. 2C  5/10 


ST/B12  10/10 

ST/24  10/10 

ST/24. 2C  S/10 
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Figure  4.  Generation  of  fusogenic  and  cytopathic  variants  of  HIV-2/ST.  A  diagram 
illustrating  the  origin  of  biologically-distinct  HIV-2/ST  substrains  is  depicted.  A  total  of  five 
subcultures  were  established  from  the  original  HIV-2/ST  isolate  by  limiting  dilution  cloning.  All 
biologically-cloned  subcultures,  the  transfection-derived  JSP4-27  cell  line,  as  well  as  the  parental 
bulk  culture  produce  non-fusogenic  and  non-cytopathic  progeny  virus  (hatched  boxes).  The  two 
fusogenic  and  cytopathic  variants  ST/24.1  C  and  ST/24.2C,  which  were  generated  by  repeated 
cell-free  passage  of  ST/24  supernatants  to  uninfected  SupTI  cells,  are  depicted  in  black  boxes. 
Also  indicated  is  the  size  of  the  envelope  transmembrane  glycoprotein  (TM)  for  virions  derived 
from  each  culture. 
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Figure  5.  Nucleotide  sequence  variation  in  the  fusion  domain  of  cytopathic  and 
noncytopathic  HIV-2/ST  strains.  230  bps  of  PCR  amplified  envelope  sequence  are  compared 
between  JSP4-27  and  four  other  HIV-2/ST  strains.  The  boundaries  of  the  amplified  fragments 
are  indicated.  The  sequence  of  both  primers  used  to  amplify  the  genomic  DNA  of  ST/BI 2, 
ST/24,  ST/24.1C  and  ST/24.2C  is  underlined,  and  the  bp  changes  which  were  introduced  to 
generate  Bam  HI  and  Pst  I  cloning  sites  are  indicated.  Nucleotide  substitutions  are  shown  with 
respect  to  the  JSP4-27  sequence  and  amino  acid  sequence  changes  unique  for  the  cytopathic 
and  fusogenic  ST/24  strains  are  bold-faced.  Asterisks  depict  the  Thr  and  Ala  substitutions 
previously  identified  to  distinguish  JSP4-27  from  most  other  cytopathic  HIV-2  viruses.  The 
number  of  M13  clones  analyzed  per  HIV-2/ST  strain  is  listed,  with  frequencies  referring  to  the 
proportion  of  clones  which  have  an  identical  sequence. 
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Figure  6.  Amino  acid  sequence  variation  in  the  fusion  domain  of  cytopathic  and  non- 
cytopathic  HIV-2/ST  strains.  The  relative  location  of  the  PCR  amplified  envelope  fragments 
is  shown  in  the  context  of  the  entire  HIV-2  envelope  open  reading  frame.  Amplification 
products  are  depicted  in  the  left  panel.  Lanes  1  and  7,  uninfected  PBL  DNA  (negative  control); 
lane  2,  ST/B12;  lane  3.  ST/24;  lane  4  ST/24.1C;  lane  5,  ST/24.2C,  and  lane  6,  SupTI/LKOOl 
(positive  control).  The  deduced  amino  acid  sequences  of  the  amplified  fusion  regions  are 
compared  to  the  corresponding  sequences  of  HIV-2/ROD  and  JSP4-27.  The  number  of  M13 
clones  analyzed  per  HIV-2/ST  strain  is  listed,  with  frequencies  referring  to  the  proportion  of 
clones  which  have  an  identical  sequence. 
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Emdop*  Amino  Add  Saquano*  Olwganc* 


HIV-2/ST 

HIV-2/ROD 

HIV-2/ISY 

HIV-2/NIHj 

HIV-2/GH 

SIV/MAC,^ 

SIV/SM 

HIV-2/ST 

19% 

17% 

18% 

16% 

28% 

28% 

HIV-2/ROO 

11% 

20% 

19% 

18% 

28% 

28% 

HIV-2/ISY 

10% 

11% 

20% 

19% 

30% 

29% 

HIV-2/NIHj 

12% 

12% 

13% 

19% 

28% 

28% 

HIV-2/GH 

11% 

12% 

12% 

15% 

29% 

29% 

siv/mac1<2 

23% 

23% 

24% 

26% 

24% 

19% 

SIV/SM 

23% 

22% 

23% 

23% 

23% 

15% 

ToM  NuctaoM  Saquanci  DMrgwica 


*  The  percent  nucleotide  sequence  divergence  between  HIV-2/ST  (JSP4-27),  HIV-2/ROD  (22).  HIV-2/ISY  (17).  HIV-2/GH  (Hascgawa  et  al.,  in  press). 
S1VMAC,<2  <*>■ and  S1VSM  (25)  is  shown  along  with  the  percent  amino  acid  sequence  divergence  of  their  envelope  glycoproteins.  Sequences  were  aligned  pairwise, 
using  the  Microgenie  computer  software  (Beckman). 


Table  1 :  Nucleotide  and  amino  acid  sequence  divergence  among  HIV-2  and  SIV  viruses. 

The  %  nucleotide  sequence  divergence  between  HIV-2/ST  (JSP4-27),  HIV-2/ROD,  HIV-2/ISY, 
HIV-2/GH,  SIV/MAC142,  and  SIV/SM  is  shown  along  with  the  %  amino  acid  sequence  divergence 
of  their  envelope  glycoproteins.  Sequences  were  aligned  pairwise  using  the  Microgenie 
Computer  Software  (Beckman). 


31 


Open  reading  frame 

%  Homology 

ST/ROD 

ST/ISY 

ROD/ISY 

Nucleotide 

Amino  acid 

Nucleotide 

Amino  acid 

Nucleotide 

Amino  acid 

LTR 

91.5 

91.4 

91.8 
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pol 
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90.2 

88.4 
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89.9“ 
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89.8“ 

84.8“ 

93.3 

89.5 

tat 

86.7 
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88.7 
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90.5 

80.8 

rev 

82.0 

85.0 

ND* 

ND* 
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env 

85.5 

81.4 

86.0 

83.0 
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nef 

86.1 

78.9 

85.8 

77.7 

87.0 

78.5 

Overall  %  homology 

89.5 

89.9 

89.3 

*  An  in-frame  stop  codon  is  present  in  the  HIV-2/ST  vpr  open  reading  frame. 

*  Meaningful  comparison  of  the  HIV-2/ISY  rev  gene  (17)  with  the  corresponding  rev  genes  of  HIV-2/ST  (JSP4-27)  and  HIV-2/ROD  (22)  was  not  possible  because 
of  considerable  length  differences  between  their  sequences.  ND,  Not  done. 


Table  2:  Sequence  homologies  among  virus-specific  genes  of  three  different  HIV-2 
proviruses.  Nucleotide  and  amino  acid  sequence  homologies  among  different  viral  genes  of 
HIV-2/ST  (JSP4-27),  HIV-2/ROD,  and  HIV-2/ISY  are  shown  in  comparison.  A  meaningful 
comparison  of  the  HIV-2/ISY  rev  gene  to  the  corresponding  rev  genes  of  HIV-2/ST  and  HIV- 
2/ROD  was  not  possible,  because  of  considerable  length  differences  between  their  sequences. 
An  asterisk  indicates  the  presence  an  in-frame  stop  codon  in  the  HIV-2/ST  vpr  open  reading 
frame. 
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Figure  7.  "Nested"  polymerase  chain  reaction  primer  amplification  as  an  approach  to  increase 
sensitivity  of  retroviral  genome  detection. 
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Figure  8.  Nested  HIV-2/SIV,^  pol  primers  with  cloning  enzyme  sites.  Outer  (pol  A  and  D)  and  inner  (pol  B  and  C)  oligonucleotide 
primer  sequences  are  shown  in  shaded  areas  and  are  derived  (rom  best  match  sequence  o(  HIV-2  and  SIV^,  EcoRI  sites  were 
introduced  within  the  inner  primer  sequences  to  facilitate  cloning  of  the  amplified  product.  The  size  of  the  outer  and  inner  amplification 
products  are  555bp  and  337bp,  respectively. 
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Figure  10.  Percent  amino  acid  sequence  Identity  in  a  279bp  PCR  amplified  pol  region  of  primate  lentivlruses.  The  sequences  of  the  four 
West  African  SIV^  viruses  are  indicated  by  The  four  SIV^.^  viruses  differ  from  each  other  by  1-7%  but  from  East  African  viruses  by 

19-22%  and  from  isolates  of  HIV-1  and  HIV-2  by  29-31%. 
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Figure  12.  Molecular  proviral  clones  of  HIV-1  derived  from  uncultured  brain  DNA  of  a  patient  with  AIDS  dementia.  Ten  HIV-1 
proviral  clones  from  a  lambda  phage  genomic  library  of  8  x  10*  recombinant  phage  were  identified  and  mapped.  Because  the 
predominant  viral  forms  in  this  uncultured  brain  DNA  specimen  were  in  unintegrated  circular  form,  a  single  cutter,  EcoRI,  was  selected 
as  the  cloning  enzyme.  Four  of  the  10  clones  were  full-length  with  either  one  or  two  LTRs  (YU-2,  10,  21,  32),  one  done  was  partial 
but  integrated  into  genomic  DNA  (YU-6),  and  the  remaining  6  clones  were  defective  by  virtue  of  deletions  or  rearrangements  (YU-1,  3, 
4.  5,  27). 
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